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ABSTRACT 

We present the first optimal power spectrum estimation and three dimensional 
deprojections for the dark and luminous matter and t heir cross correlations. The 
results are obtained using a new optimal fast estimator l)PerJl2003l) . deprojected us- 
ing minimum variance and SVD techniques. We show the resulting 3-D power spec- 
tra for dark matter and galaxies, and their covariance for the VIRMOS-DESCART 
weak lensing shear and galaxy data. The survey is most sensitive to nonlinear scales 
&nl ~ 1/iMpc -1 . On these scales, our 3-D power spe ctrum of dark matter is in goo d 
agreement with the RCS 3-D power spectrum found li\ • I cumark fc Zaldarriaga (2002). 
Our galaxy power is similar to that found by the 2MASS survey, and larger than that 
of SDSS, APM and RCS, consistent with the expected difference in galaxy population. 

We find an average bias b = 1.24 ± 0.18 for the / selected galaxies, and a cross 
correlation coefficient r = 0.75 ± 0.23. Together with the power spectra, these results 
optimally encode the entire two point information about dark matter and galaxies, 
including galaxy-galaxy lensing. We address some of the implications regarding galaxy 
halos and mass-to-light ratios. The best fit "halo" parameter h = r/b = 0.57 ± 0.16, 
suggesting that dynamical masses estimated using galaxies systematically underesti- 
mate total mass. 

Ongoing surveys, such as the Canada-France-Hawaii- Telescope-Legacy survey will 
significantly improve on the dynamic range, and future photometric redshift catalogs 
will allow tomography along the same principles. 

Key words: Cosmology-theory-simulation-observation: gravitational lensing, dark 
matter, large scale structure 



1 INTRODUCTION 

The recent measurements of the cosmic microwave back- 
ground anisotropics have moved p hysical cosmology i nto a 
new era of precision measurements iSpergel et al . 2003). The 
cosmic microwave background perturbations can be cleanly 
computed from first principles, and have been measured with 
high accuracy. This allows a clean inference of conditions at 
the redshift of recombination, z ~ 1089. To complete the 
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picture, a separate measurement of the state of the universe 
at lower redshifts is required. The original WMAP results 
used the distribution of optical galaxies as a proxy for the 
distribution of total matter. The dominant uncertainty in 
such an ex ercise is the relation ship between galaxies and 
total mass JContaldi et alj|2003h . 

A direct measure of the dynamics of total mass is 
clearly desirable, as well as a quantitative measure of the 
relationship between total mass and visible matter. Sta- 
tistical weak gravitational lensing provides such a handle, 
and direct measurements are already providing accuracies 
on cosmological parame ters such as as comparable to indi- 
rect galaxy techniques feacon et al] 120021 : iRefregier et al] 
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The gravitational field from the inhomogeneity of the 
dark matter distribution disturbs the light from background 
galaxies and distorts the apparent images of galaxies. When 
this distortion is small, it is called the weak gravitational 
lensing. Since gravity acts equally on all particles, gravita- 
tional lcnsing is a complete probe of the total matter distri- 
bution. Gravity is dominated by dark matter, which does not 
involve complicated gas physics, which makes gravitational 
lensing relatively straightforward to calculate from ana lyti- 
cal models and N-body simulations J White fc HvJ2000h . All 
these advantages make weak gravitational lensing a powerful 
probe of cosmological parameters and the matter distribu- 
tion. 

So far all analyses of weak lensing data have been para- 
metric by comparing the observed two dimensional cor- 
relations of shear to the predictions of standard models. 
A direct optimal statistical analyses of the data sets has 
so far been beyo nd the scope of computational resources. 
IPen et all (l2002h presented the first angular power spec- 
tra obtained from inversions of the correlations functions. 
IBrown et afl tOOtf) did maximum likelihood estimation on 
a low signal-to-noise data set. 

In this paper, we extract the 3-D dark and lumi- 
nous matter power spectrum from the 2-D weak lensing 
measurements in the VIRMOS-DESCART survey. Sim- 
ilar works have been tested successfully in inverting the 
2-D galax y angular correlations to 3-D galaxy 
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suring the distribution of galaxies and dark matter from 
the same s urvey, we find a dir ect measure of the cross 
correlation iHoekstra et aljl2002h . 



2 DATA 

The VIRMOS-DESCART data consist of four uncorrelated 
patches (referred as fields F02, F10, F14 and F22 accord- 
ing to their RA position) of about 4 square-degrees each 
and separated by more than 40 degrees. The fields have 
been observed with the CFH12k panoramic CCD camera, 
mounted at the Canada-France-Hawaii Telescope prime fo- 
cus, over the periods between January 1999 and November 
2001. The observations and data reduction have been de- 
scrib ed in previous VIRMOS-D ESCART co smic shear pa- 
pers llVan Waerbeke et al.lEoOol. l200ll l2002h . 

The observations have been done with the I-band fil- 
ter available on the CFH12k camera a with typical ex- 
posure time of one hour. The final cosmic shear catalog 
contains 392,055 galaxies with magnitude Iab > 22 and 
median Iab =23.6. Several careful checks have demon- 
strated that systematic residual s are very small. How ever, 
IVan Waerbeke et al.1 j200lLl2002h and lPen et ail <2002h have 
shown that a B-mode signal still remains on scales larger 
than 10 arc-minutes. Its origin is not yet understood. 

The reds hift distribution was modele d by the procedure 
described in IVan Waerbeke et alJ <l200lT) using the photo- 
metric redshifts from the Hubble deep fields (HDF). Here 
we performed the same analyses for all the galaxies with 




Figure 1. The top histogram shows the source weighted redshift 
distribution of the faint Iab > 22 catalog modeled from the HDF. 
The bottom histogram shows the bright 20 < Iab < 21 galaxy 
redshift distribution of the catalog modeled from the CFRS (see 
section 0}, and the dotted curve is the fitting formula used in our 
analyses. 



magnitude Iab > 22. The resulting histogram for this sam- 
ple with their appropriate noise weights is shown in Figure^ 
It was modelled from the photometric reds hifts of the Hub- 
ble D eep Fields north and South (see lVan Waerbeke et all 
l)200 J) V In the absence of any spectroscopic survey deeper 
than Iab = 22 this is the best redshift estimate at the mo- 
ment. 

To compute the galaxy angular power spectra, the 
galaxy photometries were calibrated. A mask file was con- 
structed on a grid spaced 6.18 arc seconds. For the lens plane 
sample, we chose a magnitude range 20 < Iab < 21, which 
contains 20657 galaxies. 



3 2-D POWER SPECTRUM ESTIMATION 

Given a reduced galaxy catalog, the first stage in the infor- 
mation compression process is to produce a two dimensional 
angular power spectrum, which encodes all two points infor- 
mation. Each galaxy has two observables: a position angle 9 
and axis ratio e, from which one forms the two polarization 
components ei = ecos(2#) and ei — esin(2#). The data set 
contains close to a million degrees of freedom. A direct op- 
timal power estimation requires 0(N 3 ) operations, which is 
completely intractable computationally. One could try to bin 
the data on a coars e grid to reduce the computational cost 
JWhite fc Hul EoOO). Current processing power allows one 
to deploy about 50 grid cells on a side in such a treatment, 
corresponding to about 3 arc minute cells. Such a coarse 
binning unfortunately looses information. Karhunen-Loeve 
compression still involves an initially expensive eigenmode 
expansion, which is not tractible for a data set of this size. 
We applied a novel fast matrix solver, which reduces the 
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Table 1. Cosmological models used to test the inversion. 



probl em from Q (N 3 ) to 0(N log TV) while still being opti- 
mal. {Pen||2003). The galaxy catalog for each field is binned 
onto a square grid 1024 cells wide, and power is estimated 
directly on this grid. The power fo r the individual fiel ds are 
then combined by the procedure of IWang et alJ l|2002). The 
Fisher matrix, error bars, and window functions are monte- 
carloed using 1000 realizations. The angular width of the 
grid was chosen to be 3.01 degrees, which is a little larger 
than the largest dimension of any field. For the lowest wave 
numbers along the longest direction of the grid this intro- 
duces an artificial aliasing due to periodic boundary con- 
ditions. In principle this could be avoided with a two level 
grid as described in IPadmanabhan et alJ J2002) , but this 
has not yet been implemented on the multi-grid accelerated 
scheme. Since we do not use the lowest wave number bin 
in our analysis, this should not have a significant effect on 
our results. The analysis is based on a quadratic estimator 
with optimal weights relative to a model prior. We chose the 
parameters of the LCDM model in table^ Several strategic 
choices are made in the process. The dark matter and galaxy 
power spectrum were estimated independently. To estimate 
the cross correlation, both must be taken into account simul- 
taneously. When one allows for a prior correlation of dark 
matter with galaxies, the eigenmodes of the power are no 
longer dark matter and galaxy power, but linear sums and 
differences. 

The priors also gave equal E and B mode weights. The 
signal is known to have measurable B mode contamination 
JPen et al.ll2002f) . for whose separation a symmetric weight 
seemed most robust. Unequal priors lead to different window 
functions for the two modes, making comparisons tricky. A 
further potential complication is the additive whit e noise cal- 
ibratio n in the power spectrum. As described in lPen et alJ 
(2002), the intrinsic distribution of the unlensed ellipticity 
distribution can not be measured, so the two point shear 
correlation function at zero lag is completely unknown. This 
translates into an additive white noise factor in the power 
spectrum. Normally the noise is subtracted from the power 
spectrum estimator, and in our case, we estimate the noise 
by randomly rotating galaxies. This will cancel the correla- 
tion function at zero lag (which is invariant under rotations) , 
and determine a fixed value of the integration constant. At 
large I this can lead to a systematic underestimate of the 
power. 

The reduced catalogues also have a statistical weight 
for each galaxy, which in Gaussian analyses is equal to 
the inverse noise variance. A random rotation would as- 
sign a noise equal to the observed ellipticity, but such a 
procedure biases the estimated power spectrum. Instead, 
the noise was assigned usin g the table lookup described by 
IVan Waerbeke et alJ (I2000T) . Our power spectrum analysis 
thus has a different procedure for the noise and weights. 
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Figure 2. Spherical harmonic power spectrum. The dashed boxes 
indicate the raw E-type power-spectrum, while the crosses denote 
the B-type power. The solid boxes with error bars are E — B, 
which is the quantity we used to calculate the three dimensional 
power. Their error bars are the quadrature sum yJa E 2 + B 2 . We 
have shifted the solid boxes with error bars to larger 1 slightly, 
in order to be distinguished with E-type and B-type power. The 
dashed straight line is the noise. The solid curved line is the model 
prediction for LCDM (see table [Tt. 



Figure |21 is the power spectrum of cosmic shear, which 
represents the fluctuations in the projected surface density 
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E(n) is the surface density, n is the direction on the sky. 
Dd, Dds and D s are the angular-diameter distance between 
the observer and lens, lens and source, observer and source 
respectively. 

The dashed boxes indicate the E-type power-spectrum, 
while the crosses denote the B-type power. The solid boxes 
with error bars are the subtracted power E — B, which is the 
quantity we used to calculate the three dimensional power. 
For the difference powers, the error bars are calculated by 
the quadrature sum \/o"b 2 + B 2 . The B mode is taken as 
a diagnostic for the error estimate, and we only added the 
value to the diagonal of the covariance matrix. Correlations 
between scales are not accounted for. The dashed straight 
line is the noise, which dominates at small angular scales. 
When noise dominates, the inverse noise weighted two point 
correlation function is an optimal estimator. The solid curve 
line is the power spectrum projected by the Limber equa- 
tion from the thre e dimensional power using the code by 
ISmith et alJ (I2002T) . We note that the errors are dominated 
by the B mode, and eyeballing the plots, one could see up 
to three independent useful power spectrum measurements. 




Figure 3. The contribution to 2-D power spectrum by the 3-D 
power spectrum from different redshifts for a fixed angular scale 
I = 956. The dotted line is for the galaxies. The solid line is for 
the dark matter. 

To measure the distribution of luminous matter, we 
chose a magnitude range which traces the redshift distri- 
bution of lenses. The range 20 < Iab < 21 results in a 
differential redshift contribution shown in Figure [3J which 
is well matched to the lens weights. We used this magni- 
tude range to measure the galaxy power spectrum shown in 
Figure 2] 

In the two dimensional projection, we fitted for a 
parametrized power spectrum A 2 = (k/ko) n . This corre- 
sponds to a power law correlation function of the form 
£ = (r/r )- n , with r = 2 [tt/2 sin(n7r/2)r(2 - n)] 1/n /k . 
We used two values of n: 1.8 and 1.7. For n = 1.8, the best 
fit correlation length is fc = 0.22 ± 0.010/iMpc" 1 for the / 
selected galaxy population at weighted redshift z m = 0.36. 
The corresponding correlation length is ro = 4.79 ± 0.22/i _1 
Mpc. The shallower slope n = 1.7 fits the data slightly bet- 
ter, and results in a 17% longer correlation length. The two 
dimensional galaxy power certainly appears consistent with 
expectations for this population of galaxies. 

We used prior weights in the galaxy power spectrum 
estimation corresponding to the n — 1.7 dot-dashed line in 
Figure [I] The galaxy power spectrum estimation is accom- 
plished on a significantly masked geometry, which results in 
aliasing of modes. This is described by the window function, 
shown in Figure [H] 

The finite sampling of the 1000 Monte-Carlo simula- 
tions to compute the Fisher matrix results in an expected 
error of about 4% on the diagonals. On the off-diagonals, 
this error becomes large compared to the actual values after 
about the second distant bin, since the actual correlations 
become quite small. We only included the diagonal and the 
first off-diagonal of the Fisher matrix in all subsequent anal- 
yses. 

Computing the cross correlation we initially used a 
light-traces-mass prior. The quadratic estimator just weights 



Figure 4. The circles indicate the power spectrum for galaxies 
20 < Iab < 21. The dashed line is the Poisson noise, and the 
solid line is the Limber equation prediction for luminous matter 
perfectly tracing dark matter. The dot-dashed and dotted lines 
are the projection of the best fit power spectra parametrized as 
A 2 = (k/k ) n , for n = 1.7 and k = 0.16(/iMpc~ 1 ), as well as 
n = 1.8 and kg = 0.18(/iMpc — 1 ) respectively. 
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Figure 5. The window function for the two dimensional power 
spectrum of shear and galaxies. ^From left to right, the lines are 
for /=239, 478, 956, 1912, 3825, 7650, 15300 respectively. The 
windowed variable is 1(1 + l)C;/27r, so for flat band-power more 
power leaks from large scales to small than vice versa. 
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Figure 6. The cross correlation between dark matter and galax- 
ies, also called galaxy-galaxy lensing power. The circles indicate 
the power spectrum for galaxies 20 < Iab < 21. The solid line is 
the Limber projection for galaxies tracing dark matter perfectly. 
The dotted line is the projection of a power law galaxy correla- 
tion (see text) with n = 1.8, which has no stochasticity relative 
to dark matter. 



the shear and galaxy surface density individually by a 
Wiener filter. For a perfectly correlated galaxy and dark 
matter field, the expected cross correlation coefficient is 0.96, 
indicating that the lensing weights and galaxy weights over- 
lap very strongly. 

The resulting power spectrum is shown in Figure |H] It 
is apparent that the cross correlation is systematically lower 
than expected for a perfectly non-stochastic galaxy distri- 
bution (dotted line). 



4 DEPROJECTION 

The projection of a three dimensional power spectrum 



(3) 



to a two dimensional angular power spectrum 1(1 + l)Ci/2n 
is given by Limber's equation llHutererll2002l) . 



fc^Q = ^ " A>(l/ x (z),zMzf x (z)^dz. 

The comoving angular diameter distance is 
dz 



o H(z) 

where H(z) is the Hubble constant at redshift z: 
H(z) = H [(l + z) 2 (Q m z + 1) - fi A z(z + 2)] 1/2 . 



(4) 
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For the angular diameter distance \ we used the fitting for- 
mula from IPenl jl999lh The weight function w(z) is defined 



below for the different categories of dark and luminous mat- 
ter. For broad weight functions, Limber's equation Q mixes 
modes significantly. A direct deprojection is analogous to 
a deconvolution, which is in general numerically unstable. 
Several alternative exist to recover the underlying three di- 
mensional power spectrum. 

All measurements of power spectra, even with full three 
dimensional information, measure the power spectrum con- 
volved with a window function. One can think of the two 
dimensional measurement an extreme case of such a win- 
dow function. Each angular wave number I is a sum over 
power spectra at different linear wave numbers k. This sum 
can be treated analogous to a window function. The current 
fash ion is not to a ttempt to deconvolve power spectra. 

ISeliakl lll998bD proposed such a minimum variance pro- 
cedure, which estimates power in broad bands, but with 
small errors. This approach leads to estimates of power over 
broad windows, and is quite robust. There is a slight model 
dependency in the normalization of the window functions, 
for which we calibrate relative to a fiducial model described 
below. As long as the power spectrum has a similar shape 
to the underlying model, the total amplitude is unbiased. If 
the bins are chosen narrower than the windows, they just 
become more and more correlated, but error bars remain 
constant. We call the minimum variance procedure Method 
A. If one considers Limber's equation to be a linear map- 
ping L, the minimum variance procedure weights each data 
point by the inverse noise variance, and projects it back us- 
ing the transpose of L. This gives a unique parameter-free 
lossless mapping, and a natural way of deprojecting the an- 
gular wave numbers / back to linear wave numbers k. 

This minimum variance deprojection is an even more 
convolved version of the original power spectrum. One can 
now attempt to deconvolve this deprojection to narrow down 
the window function. As in the past literature, we stabilize 
this deconvolution using a singular value decomposition. An 
unbounded inversion has window functions that are delta 
functions, and does not need a model to normalize. In prac- 
tice, one has to introduce a cutoff, which reduces power and 
will always bias the answer low. In this case, one can recal- 
ibrate relative to a model. The number of modes that are 
chosen is another new free parameter. Since significant ef- 
fort had been spent on this procedure historically, we include 
such an analysis for comparison. 

We now proceed to describe the specific deprojec- 
tion for the VIRMOS-DESCART angular power spectra. 
We modelled the non-linear power spectrum A 2 using the 
ISmith et alJ J2002T) code. S7 m and S1a denote the matter den- 
sity and cosmological constant density today. Whenever a 
quantity is redshift dependent, we explicitly include that, 
for example Q m {z) and 0,a(z) are the values at redshift z. 

For dark matter the lensing weight is 



3 2 
w dm {z) = -Q. m H g{z)(l + z) 

where 

9( Z ) = x( Z )£ X dz'n s (z')^l^. 



(7) 



(8) 



n s (z) is the distribution of source galaxies, for which we use 
the statistically sampled model from the HDF shown in Fig. 

m 
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For galaxies, 

galaxy(z) = H(z)n g (z) 



(9) 



where n g (z) is the lens plane galaxies distribution. We used 
a galaxy distribution parametrized as 



n g (z) 



(3 



z s r 



W(£f exp -(£) 



(10) 



with fixed values a = 2 and /3 = 1.5. z s — 0.340. We 
modelled z s = 0.340 from the CFRS catalogue llLillv et alJ 
1995: ILe Fevre e^: al.||l99!ft [ Lilly et, alJ ll995: Ha mmer et alJ 
Il995l: ICrampton et alJll995ft . We used the redshift distri- 
bution of galaxies with 20 < Iab < 21, discarding objects 
of class greater than 9 or less than 2, for which redshift 
determinations could have been problematic. This left us 
with 140 galaxies, with median redshift zm = 0.480. The 
uncertainty of zm is deduced by bootstrapping the catalog 
1000 times, resulting in a bootstrap error Azm = 0.021. 
The bootstrap error may well underestimate the true error, 
so we applied the same estimators to each of the five fields 
separately. The resulti ng median redshifts a nd bootstrap er- 
rors are: 000 0-00 field <Le Feyre et alJll995l) : 0.195 ± 0.083, 
0300 -00 fieldJHammer et alJll995T) : (1543 ± 0.039, 1000+25 
fieldllLe Fevre et al l ll995F ~ 0.495 ± 0.031, 1415+52 field 
jLillv et al.lll995l) : 0.519 ± 0.094, and the 2215+00 field 
(ibid): 0.341+0.028 respectly. Combining the results of those 
patches weighted by their inverse variances, we find a mean 
median redshift 0.431 ± 0.018, which is marginally consis- 
tent with the bootstrap error. Taking the difference squared 
between the sample median and each field median, dividing 
by the sum of the bootstrap variances, we find a x 2 — 5.8 
per degree of freedom, indicating that the bootstrap errors 
are probably an underestimate, and that the true error is 
2.4 times larger if taken at face value. Unfortunately, the 
standard deviation of this variance estimator from 5 fields is 
y/2/(n-l) = 71%, so the true error is very poorly known. 

In terms of the parameterization 1101 . we have z s — 
0.340. For the cross correlation between galaxies and dark 
matter, we changed the w(z) 2 in equation @ 1° the product 
of dark matter weight function and galaxy weight function 

Wdm,(z)w ga laxy(z). 

We assumed that the three dimensional power spectrum 
evolves linearly with redshift relative to a reference redshift 



A 2 {k,z) = A 2 (k,z„ 



D{z) 

D(Zm) 



(11) 



The errors introduced by this simplification are quantified 
below. We chose z m to be the redshift below which half 
the power originates at the middle of our angular scales of 
interest, / ~ 1000. Writing the contributions in terms of an 
integrand I(z), 



l(l + l)C t 
2tt 



I{z)dz 



(12) 



we find from Figure [3] that a value z m = 0.36 is close to the 
median contribution for both the dark matter and galaxy 
distribution. 

D(z) is th e linear growth fact or, for which we use the 
fitting formula JCarroll et alJll992h : 



D(z) = 



Di(z) 



Di(z) 



l+z£>i(0)' 



(13) 



-[(Q m (z)) 4/r - fi A (z) 



+(1 + n m (z)/2)(l + fi A (z)/70)]- 



(14) 

We group the two-dimensional power spectrum 1(1 + 
l)Ca/27r into an ni dimensional vector y and the three- 
dimensional power spectrum A 2 (k,z) an rife dimensional 
vector x. Discretizing the integral m t° a trapezoidal rule 
sum of 15 redshift slices with Az = .12, Limber's equation 
can be written as JWang et al]l2002ft 



y = Cx + n. 



(15) 



In our analysis, the projection matrix C also contains the 
effects of the window arising from the power spectrum esti- 
mation on the irregular grid. To relate the linear analysis of 
power spectra to that of Gaussian random fields, we intro- 
duced a random noise vector n, whose covariance is defined 
to be the Fisher matrix: 



(yy t >-(y>(y t >=F i3 - = (nn t ). 



(16) 



We used a bilinear interpolation in log k to evaluate the 
power spectrum in the integrand. 

An linear estimate of x can be written as: 

x = Py. (17) 

When all equations are invertible, one could formally use 

p = cr 1 . (is) 

In general, however, ni 7^ nt, and the matrix C is 
not square, not invertible, and even if it were square, 
is very ill conditioned. Then the choice of P becomes 
impor t an t. We will discuss three pr o cedur es jTsgrnark 
I l997allbt iTeemark fc de Oliveira-Costal l200ll : I Wang et al. 
2002): Method A, which gives minimal but correlated error 
bars JSeliaJdll9983bh is: 



Pi = NiC*F" 



(19) 



A diagonal normalization matrix Ni is defined relative to a 
fiducial input model power spectrum y such that y = Piy. 
In general, one can measure power spectra and cross power 
spectra as a function of source redshift, so for each I there 
could be multiple measurements of Ci with appropriate co- 
variances F. Due to the limited signal-to-noise and absence 
of detailed source redshift information in our survey, we only 
used one combined power spectrum. The procedure is gen- 
eral, and can combine any number of source and cross powers 
optimally. 

Method C is mathematically equivalent to equation 

fTHI : 

P 3 = N3(C t F _1 C) _1 C f F _1 (20) 

if C'F C is invertible. The normalization N3 is defined as 
for 11911 . If the central term is invertible, the normalization 
N3 is the identity matrix. We will use SVD to stabilize the 
problem for the general case. 

The basis of SVD comes from the following linear alge- 
bra result: Any mxn matrix F (m ^ n) can be decomposed 
to an m x n column-orthogonal matrix U, an n x n diag- 
onal matrix W with positive or zero elements(the singular 
values), and the transpose of an n x n orthogonal matrix V. 
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F = UWV\ 

The matrices U and V satisfy: 
U*U = V*V* = 1. 



(21) 



(22) 



dark matter 



If the matrix F is square,then U, V, and W are all 
n x n square matrices. Then the inverse of F is 



F _1 = V[diag(l/^)]U* 



(23) 



But if one of the Wj is zero, or (numerically) so small that its 
value is dominated by roundoff error, the inverse process will 
be incorrect. SVD prescribes the inverse of these "singular 
values" to be se t to zero. We us e the routines from Nu- 
merical Recipes jPress et alj [l992') to implement this SVD 
decomposition. 

In order to solve our problem: 



Cx = y, 



(24) 



we first consider the following sets of linear equations: 

Fx = b, (25) 

and try to invert such a equation for a square n x n matrix 
F, and vectors x and b. The first question is whether b lies 
in the range of F or not. If it does, then the set of equations 
does have one more more solutions which may be degenerate. 
If b does not lie in the range, then there is no solution. In 
both cases, replace those 1/tVj by zero if Wj — or utj is 
very small. We quantify smallness below. We then calculate 



x = V[diag(l/iWj)]U*6. 



(26) 



Here V, W, and u' are decomposed by F. This is the solu- 
tion given by SVD. 

In the case of fewer equations than unknowns(m < n), 
the method is also applicable, however there will be an n — m 
dimensional family of solutions. We have to choose a param- 
eter for the threshold to zero those small Wj . Different cutoffs 
may lead to different solutions. To find a suitable criteria for 
the cutoff, we can calibrate with simulation data. 

Finally, an intermediate choice between methods A and 
C is method B: 



N 2 (C t F _1 C) _ 2C t F _ 



(27) 



One might expect that an SVD cutoff is not needed for this 
case, since any large eigenvalues on the square root are can- 
celed by small eigenvalues of the last term. 



5 RESULTS 

5.1 dark matter power spectrum 

The deprojected three-dimensional power spectrum x from 
the observed angular power is given by a linear relation x = 
Pyob, with covariance 



(xx*) - <x)(x') = P'FP. 



(28) 



y b denotes the observed data, written as a ni dimensional 
vector. We used n; = 7 starting at 1=236 with logarithmic 
bins each a factor of two wide. In the three dimensional space 
we chose wave numbers corresponding to k — l/x( z m), and 
added two more bins on each side for a total of = 11. 

The minimum variance solution for Method A is shown 
m Figd Only the seven wave numbers in the central range 
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Figure 7. The 3-D power spectrum of dark matter linearly 
evolved to redshift zero, for the minimum variance method A. 
The dott ed lines are the inp ut power spectrum, linearly evolved 
from the ISmith et alJ i2002fl power spectrum at z m . From bot- 
tom to top are cosmologies SCDM1, SCDM2 and LCDM (see 
table The solid lines are the deprojected 3-D power spectrum 
from the non-linearly projected 2D power spectrum. The boxes 
are the power spectrum deprojected from the measured angular 
power spectrum. The deprojected lines of SCDM here were scaled 
with by (SJscdm/^lcdm) 1,2 to compensate the linear evolution 
difference. 



are plotted. Error bars are the square roots of the diagonal 
elements of the matrix 



E = P'FP. 



(29) 



The off-diagonal terms of the covariance matrix can be 
plotted in terms of cross-correlation coefficients, 



E, 



(30) 



In Figure [S] we show the cross correlation coefficient for the 
seven central solutions of Figure [7] The points are clearly 
significantly correlated. 

To simplify the inversion process we assumed linear evo- 
lution of the power amplitude with redshift. We only spec- 
ified the 3-D power spectrum at one independent redshift 
z m . The inversions are all normalized relative to the prior 
LCDM cosmology with linear evolution, for which by con- 
struction the resulting 3-D power spectrum will be the in- 
put power. If we apply the linear process to the non-linearly 
projected angular power spe ctrum, the results could differ. 
Using the [ Smith et alJ <l2002t) code we can also generate the 
full non-linear power spectrum at each redshift, from which 
we project the two dimensional angular power. In Figure [7| 
we plotted the input power spectrum at z m as the topmost 
dotted line, and the linearly recovered power spectrum as 
the corresponding solid line. We find good agreement, cer- 
tainly better than the other sources of statistical error. 
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Figure 8. The cross correlation coefficient betweens bins of Fig- 
urc EI iFrom left to right, the covariances are relative to the bin at 
k = 0.24,0.48,0.96, 1.92,3.84,7.68, 15.36(/iMpc- 1 ) respectively. 



Figure 9. The normalized window functions of dark matter for 
method A. ^From bottom to top are for the k bins in figure El For 
clarity, the lines have been displaced down 0.6, 0.4, 0.2, 0, -0.2, 
-0.4 and -0.6 units respectively. 



We need to know the stability to a change of cosmologi- 
cal parameter priors. To check cosmologies that are quite dif- 
ferent from the previously assumed LCDM model, we have 
also checked the SCDM cosmologies listed in table 1. In 
FietTl the two solid lines, from bottom up, are inverted solu- 
tions for SCDM1 and SCDM2 respectively. The dotted lines 
near them are the inputted 3-D power spectrum for those 
two models respectively. For SCDM models, the inverted 
solutions are still reasonably close to the original solution, 
which can be seen clearly from the figures. It shows that our 
inversion procedure is robust, and only mildly dependent on 
the shape of the power spectrum or non-linear evolution. 

As described above, the minimum variance power spec- 
trum estimation from Method A results in significant 
smoothing of the input power. This is described by a window 
function, which is shown in Figure [5] For our linear proce- 
dures, each bandpower estimator depends not only on the 
power in its own band, but due to geometry also on aliased 
power from other bands. The response of the estimator is 
just the window function. We see the increased breadth of 
each window compared to the two dimensional window, and 
there really are only a smaller number of independent points. 
Due to the low signal-to-noise of our data, we used wide log- 
arithmic bins, and the window functions are also averaged 
over the same bin sizes. 

The goal of methods B and C are to reduce the width 
of these windows by deconvolving the solution by the win- 
dow. If all steps were non-singular, method C would result 
in 5-function windows. For the SVD procedure, one chooses 
the number of eigenvalues (also called singular values) to 
include. The result will depend on the number of modes in- 
cluded. In Figure [TOl we show the recovered power spectrum 
as we increase the number of modes from bottom to top. 
The corresponding window functions are shown in Figure 
1111 For method C (right panel, Figure ITU1 . we see that in- 



creasing the number of modes results in increasing errors, as 
one expects from deconvolutions. For method B (left panel, 
same figure), the results are quite robust, and no cutoff is 
actually needed. The window functions in Figure 1111 show 
the effect of the SVD cutoff graphically. The bottom panel 
shows that if only one mode is used, all solutions are de- 
generate. As one increases the number of modes used, the 
windows shift apart. Method B remains stable, and results 
in windows that are narrower than Method A (shown in 
Figure [§J. For method C, the windows become ill-behaved, 
and one does not obtain a good window structure for any 
number of modes. 

The covariance of bins shows the linear structure of the 
solutions. As before, we can define the cross-correlation co- 
efficients between bins. The cross-correlation coefficients of 
Method A are shown in figure |H1 For Method B and C, if 
only one singular value is used, all modes are linearly de- 
pendent, so the cross- correlation coefficient is unity as can 
be seen in figure VTH Method B decorrelates the bins, mak- 
ing the statistical interpretation of the results particularly 
simple. The very high correlation between points in Method 
C shows that one never really recovers more than 2 inde- 
pendent modes regardless what cutoff one chooses. The full 
deconvolution does not lead to meaningful results. 

The top left panel in Figure tTUl is most readily compared 
to the literature. It is deconvolved but still stable. We see 
the three bins centered at k — 0.96 which have good signal 
to noise. The covariance matrix in Figure 1121 tells us that 
the three points are statistically uncorrelated. The top left 
panel in the window function Figure tTTl has a width of about 
two bins. While one might wonder how the bins can over- 
lap spatially and still be uncorrelated, this is not a contra- 
diction: the 2D projected power spectrum sums over many 
independent 3D modes. In the deprojection, each bin can 
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Figure 10. 3-D power spectrum of dark matter at rcdshift zero. 
The left panels are for method B, and the right panels are for 
method C. The dotte d lines are the inp ut power spectrum, lin- 
early evolved from the lSmith et alJ 120021) power spectrum at z m ■ 
The solid lines are the deprojected 3-D power spectrum from the 
non-linearly projected 2-D power spectrum. The boxes are the 
power spectrum deprojected from the measured angular power 
spectrum. The panels from bottom to top include 1, 3, 5, 7 SVD 
values in their reconstruction, respectively. 
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Figure 11. The normalized window function of dark matter for 
different SVD cutoffs. The panels are positioned as in Figure [Tfjl 
displaced down 0.9, 0.6, 0.3, 0, -0.3, -0.6, -0.9 units respectively. 
At the lowest panel, only one SVD is used, so the solution only 
has one degree of freedom and all reconstructed values are linearly 
degenerate. 



Figure 12. cross-correlation coefficient for the hierarchy of solu- 
tions. The left panels are for method B, and the right panels are 
for method C. The panels from bottom to top include 1, 3, 5, 7 
SVD values in their reconstruction, respectively. The successive 
lines have been displaced down 1.2, 0.8, 0.4, 0, -0.4, -0.8, -1.2 units 
respectively. 

still depend on different modes of the same absolute wave 
number. This gives as a series of statistically independent 
estimators of powers which probe the same physical length 

scales. 

iTegmark fc Zaldarriagal l)2002f) plotted a 3D estimate 
for the linear dark matt er power as probed b y RCS. In 
the Hamilton paradigm l|Hamilton et al.lll99lh . the non- 
linear structure on a given scale comes from the gravita- 
tional collapse of a larger scale. In an isotropic collapse, 
one expects the non-linear wavenumber to be given by the 
cube root of the density times the linear wave number, 
&NL = (1 + A 2 (/cnl)) 1 fc- Our three best points at /cnl = 
0.48,0.96, 1.92/iMpc" 1 map to k = 0.29, 0.37, 0.53ftMpc _1 , 
which is comparable the converted linear length scales mea- 
sured by the RCS data. We then used the lPeacock fc Doddsl 
( 1996) prescription to map the non-linear power to a linear 
power. The mapping was done relative to the fiducial LCDM 
model. 

The resulting combined CMB and weak lensing data 
is shown in Figure 1131 Our horizontal error bars are de- 
rived from the half width of the window function (top left 
pan el of Figure The er r or bar s have been rescaled to 
the ITegmark fc Zaldarriagal l|2002F) convention of 20% to 
80% using a Gaussian model. The CMB data, courtesy 
of Max Tegmark, includes all the experiments compiled in 
ITegmark fc Zaldarriagal <|2002ri as well as the recent WMAP 
data (Tegmark, private communication). We see good agree- 
ment between two completely different lensing data sets 
(RCS and VIRMOS-DESCART), as well as a good fit to 
the standard cosmological model. The residual differences 
could well arise from the subtleties in modelling the PD96 
prescri ption, since we see a better fit to the same model us- 
ing the lSmith et alJ ll2002tl algorithm in the non-linear power 
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Figure 13. Linearized CMB and RCS comparison power spectrum from Tcgmark fc Zaldarriagal J2002t) . The open boxes are the 
VIRMOS-DESCART power. They are mutually uncorrelated. The solid boxes are a combination of CMB powers. The crosses are RCS 
points. The lensing data sets are linearly decompressed and evolved to z = (see text for details). The solid line is the linear LCDM 
model. 



shown in the top left panel of Figure ITUl Unfortunately the 
newer, more accurate non- linear formulae are difficult to in- 
vert from non-linear to linear power heuristically. 

The CMB data can be predicted from first principles 
to exquisite accuracy. The weak lensing is similarly pre- 
dictable from first principles, and only limited by the ac- 
curacy of simulations. These are currently not a limiting 
step, but do need to improve to match newer lensing data 
sets. Both CMB and weak lensing data sets are observation- 
ally challenging to obtain, but theoretically very clean to 
interpret and are unlikely to contain astronomical uncertain- 
ties. We expect ongoing surveys such as the CFHT legacy 
survey (http://www.cfht.hawaii.edu/Science/CFHLS/) to 
bring the measurement of non-linear power to a precision 
era, for which we can then perform precision cosmology 
without invoking complex poorly understood radiative phe- 
nomena. 



5.2 galaxy and cross power spectra 

The galaxy and cross power spectra are obtained completely 
analogously. The results for Method A are shown in Fig- 
ures 1141 and 1151 and corresponding window functions are 
Figures 1161 and 1171 We compare our results to that ob- 
tained by invertin g angul ar power spectra from 2M ASS 
jMaller et al.ll2003l). APM feisenstein fc Zaldarriagal l200ll) 
and SPSS JPodelson et alJl2002ft. To compare w ith APM, 
we used table 4 of lEisenstein fc Zaldarriagal teOOll) and used 
their (Cp 1 ) 1 ^ 2 as error bars. In order to compare with our 
result, the power of APM has been linearly evolved from 
z = 0.11 to zero in figure IT4l which is analog ous to our anal- 
ysis. F or SDSS, we use the data in table 2 of lDodelson et aD 
(2002). We used the error bar not including redshift errors, 
and linearly evolved every bin to redshift ze ro. The median 
weight redshifts were taken from table 1 of lDodelson et aD 
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Figure 14. The 3-D power spectrum o f galaxies at redshi ft zero, for the minimum variance method A. The dotted line is the input 
power spectrum, linearly scaled from the lSmith et alJ i2002h power spectrum at z m = 0.36. The solid line is the deprojected 3-D power 
spectrum from the nonlinear projected 2-D power spectrum. The crosses are the power spectrum deprojected from the measured angular 
power spectrum. The shaded region is covering the deprojected 3-D power spectrum when the inpu t redshift z a varies over the bootstrap 
uncertainty from .325 to 0.355. Boxes are the power s pectrum from APM lEisenstein fc Zaldarriag3l200ll) . small circles from 2MASS 
(Mailer ct al. 2003), and large solid circles from SDSS toodelson et alJl2002h . The dot-dashed line is the parametrized power spectrum 
(fc/(0.18/iMpc -1 )) 1,8 . The dashed line is the projection/deprojection of this power law power spectrum. 



( 2002) given by the SDSS photo-z's. We combined the results 
of the 4 magnitudes bins weighted by the inverse variances. 

The VIRMOS-DESCART galaxies clearly show more 
power than the SDSS or APM data set. For reference, Figure 
1 141 also shows the dark matter power (solid line) and the best 
fit n = 1.8 power law to the angular power spectrum (dot- 
dashed line). In the inversion, we used the normalization 
coefficients N of the dark matter, which can introduce a 
bias for a power spectrum of different slope. The result of 
projecting and deprojecting the power law is shown as the 
long dashed curve, which is in general agreement with the 
input power. 

To put the result in perspective, the best fit correlation 
length (described in section^ is r = 4.79 ± 0.22(/i _1 Mpc) 



at median redshift z m — 0.36 for fixed power law index 
n — 1.8 . This is similar to correlation lengths found in 
CNOC2 dShepherd et all200lft . where the correlation length 
varied from r<j ~ 3.4 — 5.5/i _1 Mpc depending on galaxy 
population and redshift. The different power in APM and 
SDSS might just be a reflection of different galaxy types. 
The 2MASS galaxies, which are also infrared selected like 
the VIRMOS-DESCART, are in better agreement. We also 
note that overestimating galaxy distances overestimates the 
inferred power, and the nominal bootstrap uncertainty is 
reflected in the hashed region around the solid line in Fig- 
ure 1141 If we estimate the SDSS correlation length to be 
0.3/iMpc -3 linearly evolved at z — 0, we infer a correlation 
length ro = 3fo _1 Mpc, which is at the low range for the 
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Figure 15. The 3-D power spectrum of the cross-correlation 
of galaxy and dark matter at redshift zero, for method A. The 
box is the power spectrum deprojected from the angular power 
spectrum. The dotted l ine is the or i ginal power spectrum, lin- 
early evolved from the ISmith ct al. (2002) power spectrum at 
Zm = 0.36. The solid line is the deprojected 3-D power spectrum 
from the non-linearly projected 2-D power spectrum. The dot- 
dashed is the cross power for non-stochastic galaxies where the 
bias is adjusted to give a power law correlation, and the dashed 
line is the corresponding deprojection. 
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Figure 16. The window function of galaxies, for method A. 
i,From bottom to top arc for k = 0.24, k = 0.48, k = 0.96, 
k = 1.92, k = 3.84, k = 7.68, k = 15.36(/iMpc" 1 ), which have 
been moved down 0.6, 0.4, 0.2, 0, -0.2, -0.4, -0.6 units respectively. 



Figure 17. Window function of the cross-correlation of galaxy 
and dark matter, for method A. The meaning of the lines and 
signs are the same as Fig. 1161 



CNOC2 sample. We should note that the overall redshift 
calibration could be off by more than the bootstrap error. 
The redshift distribution of galaxies is also broad, and we 
assumed a linear evolution model for the clustering, which 
might not be what the galaxies actually do. In principle this 
evolution can be measured from the data itself by modelling 
the galaxies at difference magnitude cutoffs, which is the 
subject of a future paper. In any case, this comparison puts 
forwards the central role of redshift information for a correct 
cosmological interpretation of the data 

The cross power spectrum from LCDM (table is 
shown in figure [T51 as the dot-dashed line. The dashed line 
near it is the deprojected power spectrum from the non- 
linearly projection of the new cross power. The two lines are 
still close to each other, which means the inversion process 
for the cross power spectrum is also robust. 

Just as for dark matter, each of the bins has correla- 
tions. Qualitatively, they are similarly behaved to that of 
the dark matter. We only show the cross-correlation coeffi- 
cient for method A in figure [T%1 

With the full set of deprojected three dimensional power 
spectra of galaxies A? al , dark matter A^ m , and their cross- 
correlation A^ ross , we can directly m easure the derived quan- 
tities "bias" b and "stochasticity" r iPenlll99st) . The bias is: 

b =JW (3i) 

y dm 

which is shown in Fig lliJI The upper error bars are obtained 
by using the 1 — a upper value for A^ al divided by the 1 — a 
lower value of A^ m , and analogously for the lower error bar. 
From the definition, bias is dependent on the cosmology. 
The galaxy-dark matter cross-correlation coefficient is also 
directly measurable, 
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Figure 18. cross-correlation coefficient using method A for the 
galaxy and the galaxy-dark matter cross correlation. The top 
panel is for galaxy, and the bottom panel is for galaxy-dark mat- 
ter cross correlation. The meaning of lines are the same as Fig. 

El 



(32) 



dm gal 

The dark matter power has the largest error bar. One can 
take the ratio of cross and galaxy power, which has smaller 
errors. We call this the "galaxy halo parameter" h, denned 



(33) 



as 

r A 2 

7 ' "cross 

b ~ ~A\' 

gal 

It is shown in the bottom panel of Fig. 1191 The error bars 
are drawn using the procedure described above. This halo 
parameter is also dependent on cosmology, but the galaxy- 
dark matter cross-correlation coefficient is independent. 

We fitted a constant average value for b from the mini- 
mum in x 2 f° r Method B: 

A^oo-zMLtfe)] 2 



JV 

2 1 \ - 

X = T7 



(34) 



sampled at six wavenumbers ki =0.24, 0.48, 0.96, 1.92, 3.84, 
7.68/iMpc~ 1 . Since the covariances are neglible, we neglected 
them. The variance of b is taken from A^ 2 , 



V2 1 dV 



2 dW- 



(Sbf 



(35) 



We apply the same procedure to solve for r and h. For- 
mally, we find b = 1.24 ± 0.18, r = 0.75 ± 0.23, and 
h = r/b = 0.57T0.16 with X 2 = 1-20, 0.51, 0.86 per d.o.f. for 
six degrees of freedom, consistent with the expected stan- 
dard deviation of 0.6. Just as the bias measurements are 
subject to systematic redshift calibration uncertainties, the 
cross-correlation results could also depend on such issues. 
Future surveys with photometric redshifts signficantly re- 
duce this problem. 



Figure 19. The top panels are bias of galaxies versus dark mat- 
ter, for method A and B from left to right. The middle panels 
are cross-correlation coefficients of galaxy and dark matter, for 
method A on the left, and method B on the right. The bottom 
panels are the "galaxy halo parameter". The dotted lines in the 
right panels are best-fit values and the shaded region show the for- 
mal uncertainty: b = 1.24 ±0.18, r = 0.75±0.23, h = 0.57±0.16. 
In our linear evolution model, the results are redshift indepen- 
dent. 



The resulting bias b is larger tha n that found from com - 
paring VIRMOS-DESCART to RCS (iHoekstra et alJl2002ll . 
That study found b = 0.Tl±g g| and r = 0.57+^. The 
cross correlation coefficient r is consistent. The apparent dis- 
crepancy in the bias b should not be overinterpreted. This 
earlier comparison was in 2-dimensional projection. Further- 
more, RCS galaxies are red R selected, which can be a dif- 
ferent population. Within the SDSS galaxies on our scales of 
k ~ lftMpc -1 the power for galaxies varies by a factor of 4 
between the 18-19 and the 21-22 magn itude bin. Similarly, 
the 2MASS power llMaller et al.ll2003l) is closer to our de- 
rived value than the R selected SDSS galaxies. One clearly 
needs to exercise care when converting the parameter fits 
from one sample of galaxies to another one, as galaxies se- 
lected in different colours will have quite different clustering 
properties. 

Our bias, cross-correlation and halo parameters were 
all estimated in 3-D space. One could have also attempted 
a parametric estimation on the 2-D projected power. For an 
optimal inversion process if one takes the covariances be- 
tween scales into account, the results will be the same. The 
single biggest source of larger error is that we added the B 
mode power to the error budget. Our model of sample vari- 
ance in the optimal 2-D power will also give a larger sample 
variance error. The reason for that is the under-estimate of 
error on the 2-pt correlation function. The previous sample 
variance errors were estimated using an effective contigu- 
ous area. The masks and source clustering will increase the 
sample variance, since the same area is now non-uniformly 
sampled. In Figure [5] one sees that at I up to 1000, sample 
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variance makes up half the error budget (the dashed line is 
the noise, which exceeds the signal at I ~ 1000). At the end 
of the day we do have about twice the error bar, coming 
from this combination of factors. 

We had checked the effects of redshift binning, and at 
15 there was about a percent change compared to an infinite 
number of slices. The redshift evolution is parametrized, so 
the finer the redshift bins are the more accurate it gets. In 
the linear evolution model, the redshifts scale the same for 
the galaxies and dark matter, so that cancels exactly and is 
redshift invariant. 

One expects galaxies and dark matter to be well cor- 
related on linear scales when A 2 < 1 for both galaxies and 
dark matter, which is not well probed by the angular scales 
of the current data. Newer larger surveys should significantly 
improve on the angular scale coverage. When the two fields 
are well correlated, there is no sample variance in the mea- 
surements of r and b, which is reflected in the full joint 
estimation Fisher matrix. Using the current data, however, 
the errors in large scales are dominated by a B mode, which 
is not easily modelled. 



6 COMPARISON WITH THEORY 

In this section we will discuss the results of the study in the 
context of theories and other surveys. Measuring the relation 
between the distribution of light and that of dark matter has 
significant cosmological consequences, as discussed in the 
introduction. In the other direction, the theory of galaxy 
formation requires observational constraints to be tested. 
While physical cosmology originated thirty years ago in the 
paradigm that stars account for all the mass in the universe, 
today's picture is very different. The universe appears dom- 
inated by very mysterious dark energy accounting for about 
70% of the energy density of the universe. The second most 
important energetic contribution is dark matter, account- 
ing for another 27%. Ordinary baryonic matter accounts for 
another 3%. The visible stars account for less than 0.3%. 
Optical power spectra of galaxies measure the distribution 
of this 0.3% of matter, which may or may not be a good 
tracer of the hundredfold more abundant dark matter. The 
challenge to galaxy formation models is to understand the 
distribution and kinematics of that small fraction of visible 
stars. 

Different galaxies are composed of different stellar pop- 
ulations. Galaxies of different types cluster differently. Em- 
pirically it is known that red (early) type galaxies cluster 
more strongly than blue (late) type galaxies. The goal of 
the theory of galaxy formation is to quantify the distribu- 
tion of visible galaxies, i.e. the distribution of visible light. 
This distribution is quantified by various statistical proper- 
ties. At the two point level, theories of galaxy formation can 
be tested by predicting the two point statistics measured in 
this paper: auto and cross correlations. This correlation is a 
function of colour, morphology and redshift. 

Semi-analytic studies generically predict 
iSomerville et al1l200lf) earlier type (red elliptical) galaxies 
to be more strongly biased than late type (blue spiral) 
galaxies. Qualitatively, one might expect the VIRMOS- 
DESCART / band selected galaxies to be systematically 
redder than SDSS or RCS galaxies, and therefore more 



clustered (i.e. more biased). Here we should keep in mind 
that the restframe colours at our median weighted redshift 
z m ~ 0.36 are significantly bluer, so RCS/SDSS R bands 
are closer to restframe V, while VIRMOS-DESCART 
I shifts into the rest frame R. Comparison with APM 
or 2MASS is furthermore complicated by the significant 
difference in redshift distributions: these latter two surveys 
are much shallower with median redshifts of 0.11 and 0.07 
respectively. In our comparison plot shown in Figure 1141 
the different redshifts were scaled using a linear evolution 
model. Linear evolution assumes that clustering increases 
due to gravitationally induced motions. In biasing models, 
the clustering is enhanced by creating the objects in a more 
clustered fashion, such that the gravitational motions have 
a smaller fractional effect. A biased population is expected 
to evolve more slowly. The 2MASS galaxies actually have 
significantly more power than our VIRMOS-DESCART 
sample. Should the picture hold that the redder s urveys 
select for earlier type galaxies, ISomerville et all d200lf) 
predict an increasing bias as one goes from blue to red, 
which is from APM to RCS/SDSS to VIRMOS-DESCART 
to 2MASS. The stochasticity as parametrized by the cross 
correlation coefficient r was predicted to be less population 
dependent, which is what we observe. 

These qualitative statements are not easy to quantify. 
One would need to have a common measure of galaxy mor- 
phological classification into early and late types, and correct 
for evolutionary effects. Empirically, a careful study of the 
CNOC2 survey JShepherd et al.l200l|) showed the strong de- 
pendence of the clustering amplitude on the galaxy types. 
The relative evolution of each population was rapid, and the 
earl y types were much mor e clustered than the late types. 

iKochanek et alJ <l2000l) did a morphological breakdown 
of the 2MASS population, and also a comparison to sev- 
eral other surveys. According to their estimates, the bright 
2MASS sample consists of a mixture of about half early 
and half late type galaxies, while APM and other blue se- 
lected surveys have 20% early 80% late type galaxies. This 
is likely the origin for the larger power i n the 2MASS sur- 
vey, e specially on small nonlinear scales. iBrinchmann et all 
( 1998) measured the CFRS galaxy morphologies with HST, 
and also find about equal early and late type galaxies in our 
magnitude range. The CFRS galaxies are selected by sim- 
ilar colours as VIRMOS-DESCART, so we expect 2MASS 
and VIRMOS-DESCART to yield similar results. The dif- 
ference in redshift distribution opens up some leeway, but 
our results are in general consistent. 

The observable statistics do not end at the two point 
function. The full three point function and windowed skew- 
ness for the d ark matter has been measured to better than 
10% accuracy JPen et al.l2003l : lBernardeau et alj2002T) . The 
cross skewness to luminous matter has 4 moments l|Penl 
1998), which are all directly measurable and provide ad- 
ditional constraints on galaxy formation models. 

We note at this point that our model for power spec- 
trum inversion was designed with dark matter evolution in 
mind, which is physically well understood from first prin- 
ciples. We naively applied this model to the galaxy and 
cross-correlation power using the same assumptions, that 
light traces mass and that the stochasticity is small. Our 
results obtained under these assumptions show that the as- 
sumptions are only partially true: the optical galaxies are 
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biased, and there is evidence for stochasticity. The galaxy 
halo parameter h was inconsistent with unity, ruling out a 
mass traces light model. When light does not trace mass, 
as we have found, our linear evolution model used to depro- 
ject the galaxies is not a unique interpretation of the galaxy 
power. One could have many different plausible mechanisms 
which can lead to the same observed data set, but with quite 
different underlying properties. Even within a magnitude 
range, one is measuring a mixture of nearby intrinsically 
faint galaxies and distant intrinsically bright galaxies. It is 
likely that these two populations have different power, and 
do not evolve by any simple parametrized model. In the fu- 
ture, photometric redshifts will allow separation of several 
of these effects, and give a systematic parametrized hierar- 
chical measure for galaxy formation. 



7 THE GALAXY-DARK MATTER 
CONNECTION 

Mathematically, all two point statistics encode the same in- 
formation. When we observe the distribution of dark matter 
and galaxies, all two point information is described by two 
power spectra and one cross spectrum. One can also con- 
struct derivatives of these quantities, and non-linear combi- 
nations, for example the bias and cross correlation coefficient 
shown in the previous sections. 

Historically, the paradigm to understand dark matter 
was not on this equal footing, but rather centered on visi- 
ble galaxies. One could measure the luminosity and number 
density of galaxies. A popular strategy was to attempt to 
measure the mass concentration associated with the light. 
The mass concentration is known to have a larger spatial 
extent than the light, which was parametrized as the ra- 
dial halo mass distribution which we call the "halo profile" . 
If one could measure all the mass associated to halos, one 
could in multiply the number density of galaxies by the mass 
of the halos to measure the mass in the universe. Of course 
there could also be mass that is not associated with visible 
galaxies, so that would still only represent a lower limit to 
the mass density of the universe. We can connect these two 
viewpoints, which are different interpretations of the same 
numbers. 

Several approaches exist to measure the halo mass. Per- 
haps the most direct is galaxy-galaxy lensing. One defines a 
halo profile H(r), and considers each galaxy to have a halo, 
such that the dark matter distribution is the convolution 
of the position of galaxies described by a density distribu- 
tion Pgai, which may be a sum of 5-functions. One further 
assumes that all dark matter is associated with such ha- 
los. Galaxies of different morphologies, luminosity or colour 
may have different halos, and one can do a full segregated 
measurement. The formalism remains the same, so we only 
consider one universal galaxy class. One then tries to fit 
for this universal halo profile. Apart from noise weights, all 
analysis proceed as follows. The distribution of "halo" dark 
matter is 



all such galaxies, one has cross correlated the dark matter 
field 13611 with the galaxy field, 



Phalo 



(Pgai(aj)pDM(a; + r)) 



(37) 



p ga ,i(x')H(\x - x'\)d 3 x' 



(36) 



If one sums the tangential shear to each galaxy, and stacks 



PgalPDM 

In the halo model, one equates phaio = Pdm- Formally, this 
yields the cross-correlation of the galaxy position with the 
associated halo mass. This is equivalent to Fourier trans- 
forming, and multiplying both sides of equation 1361 by the 
galaxy density, 

ALssW = A 2 sal (k)h(k). (38) 

We have absorbed the normalization coefficients into the 
dimensionless halo profile nh(r) = H(r) with n = Pdm/p 9 - 
Inverse Fourier transforming and applying equation 1331 we 
find 

Hir) = nh(r) = 4nn [ r{k ] sin(fcr) k 2 dk. (39) 
J b(k) kr 

The halo profile is mathematically equivalent to the Fourier 
transform of the cross correlation coefficient divided by the 
bias, and requires only measurements of the galaxy auto- 
correlation and the galaxy-dark matter cross correlation 
functions. It is the Fourier transform of our "galaxy halo 
parameter" h defined in equation 1331 . 

Formally, one could derive a mass from the halo profile 
H(r), and compare this to masses of halos. This interpreta- 
tion is parameter dependent, since the normalization in 1391 
depends on the mean density of galaxies, which depends on 
the flux limits of the survey. The deeper a survey looks, the 
more faint galaxies one finds, so the gravitating mass is di- 
vided by more galaxies. Similarly it is tricky to extrapolate 
1391 to obtain a value of the total cosmological density. One 
only measures the mass correlated with galaxies, and there 
could be more mass that does not correlate. I.e., if r < 1, 
the halo may underestimates total mass. 

Since all two point statistics are equivalent, measure- 
ment of "halo profiles" always measure the mass-light cross 
correlation. Fitting to a universal profile is mathematically 
equivalent to equation I39H . If one knows the mean num- 
ber density of galaxies, one could use this relation to in- 
fer the mean density of matter P dm = p ga i47r J h(r)r 2 dr = 
p ga i/i(fc = 0) iWilson et all 1200 ll) . We see that the results 
can under or overestimate the total matter, depending on 
the the correlation properties of galaxies and dark matter. 
For the fiducial Qo = 0.27 model used in our analysis, the 
VIRMOS-DESCART halo parameter is h = 0.57±0.I6, im- 
plying that about fihalo — htto ~ 0.15 is in dark matter cor- 
related with / selected galaxies. This is consis tent with the 
low inferred value s of fihaio in the literature (IWilson et alJ 
l200ltlSeliakll2002D . 

A popular interpretation of the d ark matter distributio n 
has been in terms of halo models (iGuzik fc Seliakll2002fl . 
When describing the distribution of galaxies, one also needs 
to specify the cross correlation of galaxies relative to these 
halos. These cross correlation coefficients must be calibrated 
to the observed values of the "galaxy halo parameter" h(r). 



8 CONCLUSIONS 

We have presented the first full optimal analyses of two di- 
mensional and deprojected 3-D power spectra of dark mat- 
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ter and galaxies. We used weak gravitational lensing from 
the VIRMOS-DESCART survey, and their relation to the 
galaxy distribution in the same data set. The survey is sen- 
sitive to 0.37 < k < 2.88/iMpc -1 and probes the regime 
of non-linear clustering. We have compared the results of 
three different deprojection procedures, and found that our 
method B based on partial deconvolution is simultaneously 
robust, has narrow window functions, and mostly uncor- 
rected error bars. Full inversion is unstable, as might be 
expected, and a cut with SVD results in a highly tangled 
window function. No choice of cutoff leads to a meaningful 
three dimensional power spectrum for this full inversion, as 
measured by the covariance matrix of the solution. For the 
galaxy power spectrum, the inversion is more stable, but 
still noisy. 

We tested the effects of incorrect model priors. While 
the true power is very non-linear, using a linear evolution 
model relative to the median redshift does not introduce a 
large error. For the dark matter power spectrum, the er- 
rors are dominated by the B-mode and shot noise. For the 
galaxy and cross correlation power spectrum the noise is sig- 
nificantly smaller, and redshift distribution errors dominate. 

The deprojected dark matter distribution is consistent 
with that expected from the standard WMAP A cosmologies 
with 08 = 0.9. The results agree well with deprojected RCS 
lensing data and the CMB linearly evolved power spectrum. 
The galaxy distribution is similar to the 2MASS galaxies, 
but more clustered than that found by APM and SDSS. This 
may be due to the different colour band selection. Using the 
cross correlation we confirm earlier results that galaxies and 
dark matter are indeed distributed stochastically on non- 
linear scales. 

We have quantified the bias and cross-correlation co- 
efficient b = 1.24 ± 0.18, r = 0.75 ± 0.23. A less noisy 
combination of the parameters that can be measured is the 
"galaxy halo parameter" h = r/b — 0.57 ± 0.16. These pa- 
rameters describe the relation between dark and luminous 
matter, and are the key uncertainties in the interpretation 
of galaxy-galaxy lensing. 

All error analyses used Gaussian assumptions. Most of 
the regime is noise dominated, for which Gaussianity is a 
good approximation. For th e sample variance t his poten- 
tially underestimates errors l|Hu fc White! 1200 ll) . We plan 
future analyses to quantify this effect using N-body sim- 
ulations. The upcoming Canada-France-Hawaii- Telescope 
Legacy Survey weak lensing survey should significantly im- 
prove on all measurements, and probe to larger scales. At 
larger scales one will be able to measure bias and stochastic- 
ity without being affected by finite field size sample variance. 
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